library(tidyverse)
library(writexl)
library(plyr)
----------------------------------------------------------------------------------------------------------------------------------
You have loaded plyr after dplyr - this is likely to cause problems.
If you need functions from both plyr and dplyr, please load plyr first, then dplyr:
library(plyr); library(dplyr)
----------------------------------------------------------------------------------------------------------------------------------
Attaching package: ‘plyr’
The following objects are masked from ‘package:dplyr’:
arrange, count, desc, failwith, id, mutate, rename, summarise, summarize
The following object is masked from ‘package:purrr’:
compact
library(lubridate)
Attaching package: ‘lubridate’
The following objects are masked from ‘package:base’:
date, intersect, setdiff, union
library(plotly)
Registered S3 method overwritten by 'data.table':
method from
print.data.table
Registered S3 method overwritten by 'htmlwidgets':
method from
print.htmlwidget tools:rstudio
Attaching package: ‘plotly’
The following objects are masked from ‘package:plyr’:
arrange, mutate, rename, summarise
The following object is masked from ‘package:ggplot2’:
last_plot
The following object is masked from ‘package:stats’:
filter
The following object is masked from ‘package:graphics’:
layout
Exchange rate and sentiment: is there a connection?
Before we can start with our machine learning model we need to understand the relationship between the two variables, therefore we should calculate covariance. This measures the direction of a relationship between the two variables.
First step: creating a dataframe from the csv
btc_exchange_rate_history <- read.csv("D:/Suli/Szakdolgozat1/development_n_stuff/aggregated_data.csv") %>%
select(-X) %>%
mutate(Date = as_date(Date))
Error in `mutate()`:
! Problem while computing `Date = as_date(Date)`.
Caused by error in `as.Date.default()`:
! do not know how to convert 'x' to class “Date”
Backtrace:
1. ... %>% mutate(Date = as_date(Date))
8. lubridate::as_date(Date)
10. base::as.Date.default(x, ...)
11. base::stop(...)
Second step: plotting the data on a scatterplot
todo: exchange price changehez nézni, nem az árhoz
btc_sent_plot
No trace type specified:
Based on info supplied, a 'scatter' trace seems appropriate.
Read more about this trace type -> https://plotly.com/r/reference/#scatter
No scatter mode specifed:
Setting the mode to markers
Read more about this attribute -> https://plotly.com/r/reference/#scatter-mode
No trace type specified:
Based on info supplied, a 'scatter' trace seems appropriate.
Read more about this trace type -> https://plotly.com/r/reference/#scatter
No scatter mode specifed:
Setting the mode to markers
Read more about this attribute -> https://plotly.com/r/reference/#scatter-mode
Third step: calculate covariance and correlation
btc_cov
[1] 21506.41
A positive covariance means that the two variables tend to increase or decrease together. Correlation helps us analyze the effect of changes made in one variable over the other variable of the dataset. Now that we know this, we should calculate the strength of the relationship between two, numerically measured, continuous variables.
btc_cor
[1] 0.1836311
LS0tDQp0aXRsZTogIlBsb3RzIGFuZCBncmFwaHMgZm9yIHRoZXNpcyINCm91dHB1dDogaHRtbF9ub3RlYm9vaw0KLS0tDQpgYGB7cn0NCmxpYnJhcnkodGlkeXZlcnNlKQ0KbGlicmFyeSh3cml0ZXhsKQ0KbGlicmFyeShwbHlyKQ0KbGlicmFyeShsdWJyaWRhdGUpDQpsaWJyYXJ5KHBsb3RseSkNCmBgYA0KDQoNCiMjIEV4Y2hhbmdlIHJhdGUgYW5kIHNlbnRpbWVudDogaXMgdGhlcmUgYSBjb25uZWN0aW9uPw0KQmVmb3JlIHdlIGNhbiBzdGFydCB3aXRoIG91ciBtYWNoaW5lIGxlYXJuaW5nIG1vZGVsIHdlIG5lZWQgdG8gdW5kZXJzdGFuZCB0aGUgcmVsYXRpb25zaGlwIGJldHdlZW4gdGhlIHR3byB2YXJpYWJsZXMsIHRoZXJlZm9yZSB3ZSBzaG91bGQgY2FsY3VsYXRlIGNvdmFyaWFuY2UuIFRoaXMgbWVhc3VyZXMgdGhlIGRpcmVjdGlvbiBvZiBhIHJlbGF0aW9uc2hpcCBiZXR3ZWVuIHRoZSB0d28gdmFyaWFibGVzLg0KDQojIEZpcnN0IHN0ZXA6IGNyZWF0aW5nIGEgZGF0YWZyYW1lIGZyb20gdGhlIGNzdg0KYGBge3J9DQpidGNfZXhjaGFuZ2VfcmF0ZV9oaXN0b3J5IDwtIHJlYWQuY3N2KCJEOi9TdWxpL1N6YWtkb2xnb3phdDEvZGV2ZWxvcG1lbnRfbl9zdHVmZi9hZ2dyZWdhdGVkX2RhdGEuY3N2IikgJT4lDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgIHNlbGVjdCgtWCkgJT4lDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgIG11dGF0ZShjcmVhdGVkX2F0ID0gYXNfZGF0ZShjcmVhdGVkX2F0KSkNCg0KDQoNCmJ0Y191c2RfdHdlZXRzX2NvbWJpbmVkIDwtIHJlYWQuY3N2KCJEOi9TdWxpL1N6YWtkb2xnb3phdDEvZGF0YV90b19iZV9jbGVhbmVkL2NvaW5fQml0Y29pbi5jc3YiKSAlPiUNCiAgICAgICAgICAgICAgICAgICAgICAgICAgIG11dGF0ZShEYXRlID0gYXNfZGF0ZXRpbWUoRGF0ZSkpICU+JQ0KICAgICAgICAgICAgICAgICAgICAgICAgICAgbXV0YXRlKHllYXIgPSB5ZWFyKERhdGUpLCBtb250aCA9IG1vbnRoKERhdGUpLCBkYXkgPSBkYXkoRGF0ZSkpICU+JQ0KICAgICAgICAgICAgICAgICAgICAgICAgICAgbXV0YXRlKERhdGUgPSBtYWtlX2RhdGUoeWVhcixtb250aCxkYXkpKSAlPiUgDQogICAgICAgICAgICAgICAgICAgICAgICAgICBmaWx0ZXIoRGF0ZSA+PSAiMjAxOS0wMS0wMSIgJiBEYXRlIDw9ICIyMDIyLTAzLTMwIikgJT4lDQogICAgICAgICAgICAgICAgICAgICAgICAgICBpbm5lcl9qb2luKGJ0Y19leGNoYW5nZV9yYXRlX2hpc3RvcnksIGJ5ID0gYygiRGF0ZSIgPSAiY3JlYXRlZF9hdCIpKSAlPiUgDQogICAgICAgICAgICAgICAgICAgICAgICAgICBzZWxlY3QoYygteWVhciwtbW9udGgsLWRheSkpDQoNCmBgYA0KDQpgYGB7cn0NCmJ0Y19zZW50X2xpbmVwbG90IDwtIHBsb3RfbHkoZGF0YSA9IGJ0Y191c2RfdHdlZXRzX2NvbWJpbmVkLCB4ID0gfkRhdGUsIHkgPSB+Q2xvc2UsIG5hbWUgPSdleGNoYW5nZSByYXRlJywgdHlwZSA9ICdzY2F0dGVyJywgbW9kZSA9ICdsaW5lcycpDQoNCmJ0Y19zZW50X2xpbmVwbG90IDwtIGJ0Y19zZW50X2xpbmVwbG90ICU+JSBhZGRfdHJhY2UoeSA9IH5kYWlseV9hdmdfc2VudCwgbmFtZSA9ICdhdmcuIHNlbnRpbWVudCcsIG1vZGUgPSAnbGluZXMnKSANCg0KYnRjX3NlbnRfbGluZXBsb3QNCmBgYA0KDQojIFNlY29uZCBzdGVwOiBwbG90dGluZyB0aGUgZGF0YSBvbiBhIHNjYXR0ZXJwbG90DQp0b2RvOiBleGNoYW5nZSBwcmljZSBjaGFuZ2VoZXogbsOpem5pLCBuZW0gYXogw6FyaG96DQoNCg0KYGBge3J9DQpidGNfc2VudF9zY2F0dGVycGxvdCA8LSBwbG90X2x5KGRhdGEgPSBidGNfdXNkX3R3ZWV0c19jb21iaW5lZCwgeSA9ICB+ZGFpbHlfYXZnX3NlbnQsIHggPSB+Q2xvc2UsDQogICAgICAgICAgICAgICAgICAgICAgICAgICAgbWFya2VyID0gbGlzdChzaXplID0gNCwNCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgY29sb3IgPSAncmdiYSgyNTUsIDE4MiwgMTkzLCAuOSknLA0KICAgICAgICAgICAgICAgICAgICAgICAgICAgICBsaW5lID0gbGlzdChjb2xvciA9ICdyZ2JhKDE1MiwgMCwgMCwgLjgpJywgd2lkdGggPSAxKSkpDQoNCmJ0Y19zZW50X3NjYXR0ZXJwbG90DQpgYGANCg0KIyBUaGlyZCBzdGVwOiBjYWxjdWxhdGUgY292YXJpYW5jZSBhbmQgY29ycmVsYXRpb24NCg0KYGBge3J9DQpidGNfY292IDwtIGNvdihidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRkYWlseV9hdmdfc2VudCxidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRDbG9zZSwgbWV0aG9kID0gInNwZWFybWFuIikNCg0KYnRjX2Nvdg0KDQpgYGANCkEgcG9zaXRpdmUgY292YXJpYW5jZSBtZWFucyB0aGF0IHRoZSB0d28gdmFyaWFibGVzIHRlbmQgdG8gaW5jcmVhc2Ugb3IgZGVjcmVhc2UgdG9nZXRoZXIuIA0KQ29ycmVsYXRpb24gaGVscHMgdXMgYW5hbHl6ZSB0aGUgZWZmZWN0IG9mIGNoYW5nZXMgbWFkZSBpbiBvbmUgdmFyaWFibGUgb3ZlciB0aGUgb3RoZXIgdmFyaWFibGUgb2YgdGhlIGRhdGFzZXQuDQpOb3cgdGhhdCB3ZSBrbm93IHRoaXMsIHdlIHNob3VsZCBjYWxjdWxhdGUgdGhlIHN0cmVuZ3RoIG9mIHRoZSByZWxhdGlvbnNoaXAgYmV0d2VlbiB0d28sIG51bWVyaWNhbGx5IG1lYXN1cmVkLCBjb250aW51b3VzIHZhcmlhYmxlcy4NCg0KYGBge3J9DQpidGNfY29yIDwtIGNvcihidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRkYWlseV9hdmdfc2VudCxidGNfdXNkX3R3ZWV0c19jb21iaW5lZCRDbG9zZSwgbWV0aG9kPSJzcGVhcm1hbiIpDQoNCmJ0Y19jb3INCmBgYA0KDQoNCg==